An Empirical Study on Machine Learning for Tweet Sentiment Analysis
نویسنده
چکیده
Tweet sentiment analysis has been an effective and valuable technique in the sentiment analysis domain. As the most widely used approach for tweet sentiment analysis, machine learning algorithms work well on the sentiment classification, just as they have been successfully applied for many other purposes. In this thesis, we conduct a systematic and thorough empirical study on the machine learning algorithms for tweet sentiment analysis, and expect to provide a guideline for applying machine learning algorithms for tweet sentiment analysis. Based on our experiments, we found that the Support Vector Machine (SVM) and the Random Forest (RF) work better than Maximum Entropy (MaxEnt), Adaptive Boosting (AdaBoost) and Naive Bayes on tweet sentiment analysis. For the pre-processing methods, stop words removal can improve the performance of classifiers obviously, and the combination of bi-grams + SentiWordNet + Stop words removal is the most effective pre-processing method combination in our experiments.
منابع مشابه
Forecasting Stock Price Movements Based on Opinion Mining and Sentiment Analysis: An Application of Support Vector Machine and Twitter Data
Today, social networks are fast and dynamic communication intermediaries that are a vital business tool. This study aims at examining the views of those involved with Facebook stocks so that we can summarize their views to predict the general behavior of this stock and collectively consider possible Facebook stock price movements, and create a more accurate pattern compared to previous patterns...
متن کاملIf You are Happy and Know It . . . Tweet
Extracting sentiment from Twitter data is one of the fundamental problems in Social Media Analytics. The length constraint of Twitter, an average of about six words per message, renders determining the positive or negative sense of a tweet difficult even for a human judge. In this work we present a general framework for single tweet (in contrast with batches of tweets) sentiment analysis which ...
متن کاملAutomatically Building a Corpus for Sentiment Analysis on Indonesian Tweets
The popularity of the user generated content, such as Twitter, has made it a rich source for the sentiment analysis and opinion mining tasks. This paper presents our study in automatically building a training corpus for the sentiment analysis on Indonesian tweets. We start with a set of seed sentiment corpus and subsequently expand them using a classifier model whose parameters are estimated us...
متن کاملSenti.ue: Tweet Overall Sentiment Classification Approach for SemEval-2014 Task 9
This document describes the senti.ue system and how it was used for participation in SemEval-2014 Task 9 challenge. Our system is an evolution of our prior work, also used in last year’s edition of Sentiment Analysis in Twitter. This system maintains a supervised machine learning approach to classify the tweet overall sentiment, but with a change in the used features and the algorithm. We use a...
متن کاملStream-based active learning for sentiment analysis in the financial domain
Studying the relationship between public sentiment and stock prices has been the focus of several studies. This paper analyzes whether the sentiment expressed in Twitter feeds, which discuss selected companies and their products, can indicate their stock price changes. To address this problem, an active learning approach was developed and applied to sentiment analysis of tweet streams in the st...
متن کامل